Instance-Based Online Learning of Deterministic Relational Action Models

نویسندگان

Joseph Z. Xu

John E. Laird

چکیده

We present an instance-based, online method for learning action models in unanticipated, relational domains. Our algorithm memorizes preand post-states of transitions an agent encounters while experiencing the environment, and makes predictions by using analogy to map the recorded transitions to novel situations. Our algorithm is implemented in the Soar cognitive architecture, integrating its task-independent episodic memory module and analogical reasoning implemented in procedural memory. We evaluate this algorithm’s prediction performance in a modified version of the blocks world domain and the taxi domain. We also present a reinforcement learning agent that uses our model learning algorithm to significantly speed up its convergence to an optimal policy in the modified blocks

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Efficient Learning of Relational Models for Sequential Decision Making

OF THE DISSERTATION Efficient Learning of Relational Models for Sequential Decision Making by Thomas J. Walsh Dissertation Director: Michael L. Littman The exploration-exploitation tradeoff is crucial to reinforcement-learning (RL) agents, and a significant number of sample complexity results have been derived for agents in propositional domains. These results guarantee, with high probability, ...

متن کامل

From Non-Deterministic to Probabilistic Planning with the help of Statistical Relational Learning

Using machine learning techniques for planning is getting increasingly more important in recent years. Various aspects of action models can be induced from data and then exploited for planning. For probabilistic planning, natural candidates are learning of action effects and their probabilities. For expressive formalisms such as PPDDL, this is a difficult problem since they can introduce easily...

متن کامل

Online Streaming Feature Selection Using Geometric Series of the Adjacency Matrix of Features

Feature Selection (FS) is an important pre-processing step in machine learning and data mining. All the traditional feature selection methods assume that the entire feature space is available from the beginning. However, online streaming features (OSF) are an integral part of many real-world applications. In OSF, the number of training examples is fixed while the number of features grows with t...

متن کامل

Relational Instance Based Regression for Relational Reinforcement Learning

The full paper on this topic appears in the Proceedings of the Twentieth International Conference on Machine Learning. [1] Q-learning [6] is a model free approach to tackle reinforcement learning problems which calculates a Qualityor Q-function to represent the learned policy. The Q-function takes a state-action pair as input and outputs a real number which indicates the quality of that action ...

متن کامل

A Higher Order Online Lyapunov-Based Emotional Learning for Rough-Neural Identifiers

o enhance the performances of rough-neural networks (R-NNs) in the system identification‎, ‎on the base of emotional learning‎, ‎a new stable learning algorithm is developed for them‎. ‎This algorithm facilitates the error convergence by increasing the memory depth of R-NNs‎. ‎To this end‎, ‎an emotional signal as a linear combination of identification error and its differences is used to achie...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2010

Instance-Based Online Learning of Deterministic Relational Action Models

نویسندگان

چکیده

منابع مشابه

Efficient Learning of Relational Models for Sequential Decision Making

From Non-Deterministic to Probabilistic Planning with the help of Statistical Relational Learning

Online Streaming Feature Selection Using Geometric Series of the Adjacency Matrix of Features

Relational Instance Based Regression for Relational Reinforcement Learning

A Higher Order Online Lyapunov-Based Emotional Learning for Rough-Neural Identifiers

عنوان ژورنال:

اشتراک گذاری